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These should be interpreted as being references to the corresponding ETSI deliverables. The mapping of document 
identities is as follows: 

For 3GPP documents: 
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ETSI TS I TR Inn nnn "[Digital cellular telecommunications system (Phase 2+) (GSM);] Universal Mobile 
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Foreword 



id , 



This Technical Specification has been produced by the 3 r Generation Partnership Project, Technical Specification 
Group Services and System Aspects, Working Group 4 (Codec). 

The contents of this informal TS may be subject to continuing work within the 3GPP and may change following formal 
TSG-S4 approval. Should TSG-S4 modify the contents of this TS, it will be re-released with an identifying change of 
release date and an increase in version number as follows: 

Version m.t.e 

where: 

m indicates [major version number] 

x the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, 
updates, etc. 

y the third digit is incremented when editorial only changes have been incorporated into the specification. 
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1 Scope 



This Technical Standard (TS) contains an electronic copy of the ANSTC code for the Adaptive Multi-Rate codec. The 
ANSTC code is necessary for a bit exact implementation of the Adaptive Multi Rate speech transcoder (TS 26.090 [2]), 
Voice Activity Detection (TS 26.094 [6]), comfort noise (TS 26.092 [4]), source controlled rate operation (TS 
26.093 [5]) and example solutions for substituting and muting of lost frames (TS 26.091 [3]). 



Normative references 



This TS incorporates by dated and undated reference, provisions from other publications. These normative references 
are cited at the appropriate places in the text and the publications are listed hereafter. For dated references, subsequent 
amendments to or revisions of any of these publications apply to this TS only when incorporated in it by amendment or 
revision. For undated references, the latest edition of the publication referred to applies. 

[1] TS 26.074 : "AMR Speech Codec; Test sequences". 

[2] TS 26.090 : "AMR Speech Codec; Speech transcoding". 

[3] TS 26.091 : "AMR Speech Codec; Substitution and muting of lost frames". 

[4] TS 26.092 : "AMR Speech Codec; Comfort noise aspects". 

[5] TS 26.093 : "AMR Speech Codec; Source controlled rate operation". 

[6] TS 26.094 : "AMR Speech Codec; Voice Activity Detection" 



3 Definitions and abbreviations 

3.1 Definitions 

Definition of terms used in the present document, can be found in TS 06.090 [2], TS 06.091 [3], TS 06.092 [4], 
TS 06.093 [5] and TS 06.094 [6]. 

3.2 Abbreviations 

For the purpose of the present document, the following abbreviations apply: 

ANSI American National Standards Institute 

ETS European Telecommunication Standard 

GSM Global System for Mobile communications 

I/O Input/Output 

RAM Random Access Memory 

ROM Read Only Memory 



C code structure 



This clause gives an overview of the structure of the bit-exact C code and provides an overview of the contents and 
organization of the C code attached to this document. 

The C code has been verified on the following systems: 

Sun Microsystems workstations and GNU gcc compiler; 

DEC Alpha workstations and GNU gcc compiler; 
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IBM PC/AT compatible computers with Linux operating system and GNU gcc compiler; 
ANSI-C 9899 was selected as the programming language because portability was desirable 

4.1 Contents of the C source code 

The C code distrubution has all files in the root level. 

The distributed files with suffix "c" contain the source code and the files with suffix "h" are the header files. The ROM 
data is contained mostly in files with suffix "tab". 

The C code distribution also contains one speech coder installation verification data file, "spch_dos.inp". The reference 
encoder output file is named "spch_dos.cod", the reference decoder input file is named "spch_dos.dec" and the 
reference decoder output file is named "spch_dos.out". These four files are formatted such that they are correct for an 
IBM PC/AT compatible computer. The same files with reversed byte order of the 16 bit words are named 
"spch_unx.inp", "spch_unx.cod", "spch_unx."dec" and "spch_unx.out", respectively. 

Final verification is to be performed using the GSM Adaptive Multi-Rate test sequences described in GSM 06.74 [2]. 

Makefiles are provided for the platforms in which the C code has been verified (listed above). Once the software is 
installed, this directory will have a compiled version of encoder and decoder (the bit-exact C executables of the speech 
codec) and all the object files. 

4.2 Program execution 

The GSM Adaptive Multi-Rate codec is implemented in two programs: 

(encoder) speech encoder; 

(decoder) speech decoder. 

The programs should be called like: 

encoder [encoder options] <speech input file> <parameter file> 

decoder [decoder options] <parameter file> <speech output file> 

The speech files contain 16-bit linear encoded PCM speech samples and the parameter files contain encoded speech 
data and some additional flags. 

The encoder and decoder options will be explained by running the applications with option -h. See the file readme.txt 
for more information on how to run the encoder and decoder programs. 



4.3 Coding style 



The C code is written according to the following structuring conventions. Each function func() that needs static 
variables is considered a module. A module consists of: 

a 'state structure' (struct) combining the static variables of the module 

- three auxiliary functions func_init(), func_reset(), and func_exit(). 

the processing function func() itself 

The initialization function func_init() allocates (from the heap) a new state structure, calls the func_reset() function, 
stores the pointer to the newly allocated structure in its first function parameter, and returns with a value of if 
completed successful or a value of 1 otherwise. 

The reset function func_reset() takes a pointer to the state structure and resets all members of the structure to a 
predefined value ('homing'). 



ETSI 



(3G TS 26.073 version 3.0.0 Release 1 999) 7 ETSI TS 1 26 073 V3.0.0 (2000-01 ) 



The exit function func_exit() performs any necessary cleanup and frees the state structure memory. 

The processing function func() also takes a pointer to the state structure as well as all other necessary parameters and 
performs its task using (and possibly modifying) the values in the state structure. 

If a module calls other modules, the higher level state structure contains a pointer to the lower level state structures, and 
the init, reset, and exit functions recursively call the corresponding lower level functions. 

By this convention, the code becomes "instantiable" (more than one copy of a module can be used in the same program) 
and the static data hierarchy is clearly visible in the code. 



4.4 Code hierarchy 



Figures 1 to 4 are call graphs that show the functions used in the speech codec, including the functions of V AD, DTX, 
and comfort noise generation. 

Each column represents a call level and each cell a function. The functions contain calls to the functions in rightwards 
neighboring cells. The time order in the call graphs is from the top downwards as the processing of a frame advances. 
All standard C functions: printf(), fwrite(), etc. have been omitted. Also, no basic operations (add(), L_add(), mac(), 
etc.) or double precision extended operations (e.g. L_Extract()) appear in the graphs. The initialization of the static 
RAM (i.e. calling the _init functions) is also omitted. 

The basic operations are not counted as extending the depth, therefore the deepest level in this software is level 7. 

The encoder call graph is broken down into three separate call graphs, Table 1 to 3. 
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Table 1 : Speech encoder call structure 



Speech_Encode_Frame 


Pre Process 










cod amr 


Copy 




Vad1 1 


filter_bank 


first_filter_stage 




filter5 




filter3 




level calculation 




vad decision 


complex estimate adapt 




complex vad 




noise estimate update 


update cntrl | 


hangover addition 




Vad2 1 


block_norm 
r fft 

fn10Log10 
Pow2 


c fft 
Log2 










Log2 norm 






tx dtx handler 




Ipc 


Autocorr 




Lag window 




Levinson 




Isp 


Az Isp 


Chebps 




Q_plsf_5 


Lsp Isf 




Lsf wt 




Vq subvec 




Vq subvec s 




Reorder Isf 




Lsf Isp 




Int Ipc 1and3 2 


Lsp az 


Get lsp_pol 


Int Ipc 1and3 


Lsp az 


Get Isp pol 


Q_plsf_3 


Lsp Isf 




Lsf wt 




Copy 




Vq subvec3 




Vq subvec4 




Reorder Isf 




Lsf Isp 




Int Ipc 1to3 2 


Lsp az 


Get lsp_pol 


Int Ipc 1to3 


Lsp az 


Get lsp_pol 


Copy 






dtx buffer 


Copy 




Log2 


Log2 norm 




dtx enc 


Lsp Isf 






Reorder Isf 




Lsf Isp 




Set zero 






lsp_reset 


Copy 




Q plsf reset 




cl Itp reset 


Pitch fr reset 




check Isp 






pre_big 


Weight Ai 




Residu 




Syn filt 




oljtp 


Pitch_ol 


vad tone detection update 2 




Lag_max 


vad tone detection 2 


Inv sqrt 


comp corr 2 




hp max 2 




vad complex detection update 2 




Pitch_ol_wgh 


comp corr 2 




Lag_max 2 


vad tone detection update 2 


vad tone detection 2 


gmed n 




hp max 2 




vad complex detection update 2 




vad_pitch_detection 


LTP flag update 3 






subframePreProc 


Weight Ai 




Syn filt 




Residu 




Copy 




cljtp 


Pitch fr 

(continued) 


getRange 




Norm Corr 


Convolve 


Inv sqrt 


search Frac 


Interpol 3or6 


Enc Iag3 




Enc Iag6 











1 Option to call one or the other VAD option 

2 Specific to VAD option 1 

3 Specific to VAD option 2 
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Table 1 (concluded): Speech encoder call structure 









Pred It 3or6 


Convolve 


G pitch 


check gp clipping 


q gain pitch 


cbsearch 


see Table 2 


gainQuant 


see Table 3 


update gp clipping 


Copy 


subframePostProc 


Syn tilt 


Pred It 3or6 




Convolve 




Prm2bits 


Int2bin 





Table 2: cbsearch call structure 



cbsearch 


code 2i40 9bits 


cor h x 




set sign 




cor h 


Inv sqrt 


search 2i40 




build code 




code_2i40_1 1 bits 


cor h x 




set sign 




cor h 


Inv sqrt 


search 2i40 




build code 




code_3i40_14bits 


cor h x 




set sign 




cor h 


Inv sqrt 


search 3i40 




build code 




code 4i40 17bits 


cor h x 




set sign 




cor h 


Inv sqrt 


search 4i40 




build code 




code 8i40 31 bits 


cor h x 




set sign12k2 


Inv sqrt 


cor h 


Inv sqrt 


search 10and8i40 




build code 




compress code 


compresslO 


code_1 0i40_35bits 


cor h x 




set sign12k2 


Inv sqrt 


cor h 


Inv sqrt 


search 10and8i40 




build code 




q_p 





Table 3: gainQuant call structure 



gainQuant 


gc pred copy 


Copy 






gc_pred 


Log2 


Log2 norm 




Log2 norm 






calc tilt energies 






calc target energy 




MR475 update unq pred 


gc pred update 




MR475_gain_quant 


MR475_quant_store_results 


Log2 


Log2 norm 


gc pred update 




gc_pred 


Log2 


Log2 norm | 


Log2 norm 




G code 








q gain code 


Pow2 




MR795_gain_quant 


q gain pitch 




MR795 gain code quant3 




calc unfilt energies 


Log2 


Log2 norm | 


gain adapt 


gmed n 




MR795 gain code quant mod 


sqrt 1 exp 




Qua gain 


Pow2 






gc_pred_update 
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Table 4: Speech decoder call structure 



Speech_Decode_Frame 



Bits2prm 



Decode r_amr 



Post_Filter 



Post_Process 



rxdtx handler 



Decode r_amr_reset 



Lsfjsp 



lsp_avg 



Copy 



D_plsf_3 



lnt_lpc_1to3 



D_plsf_5 



Int_lpc_1and3 



Dec_lag3 



Dec_lag6 



decode_2i40_9bits 



decode_2i40_11bits 



decode_3i40_14bits 



decode_4i40_1 7bits 



decode_8i40_31bits 



ec_gain_pitch 



d_gain_pitch 



ec_gain_pitch_update 



decode_10i40_35bits 



Dec_gain 



ec_gain_code 



ec_gain_code_update 



d_gain_code 



Intjsf 



Cb_gain_average 



ph_disp_release 



ph_disp_lock 



ph_disp 



sqrt_l_exp 



agc2 



Synjilt 



Bgn_scd 



dtx_dec_activity_update 



lsp_avg 



Copy 



Weight_Ai 



Residu 



Set_zero 



Synjilt 



Preemphasis 



age 



lsp_avg_reset 



D_plsf_reset 



ec_gain_pitch_reset 



ec_gain_code_reset 



gc_pred_reset 



Bgn_scd_reset 



ph_disp_reset 



dtx_dec_reset 



Copy 



Lsfjsp 



lnit_D_plsf_3 



D_plsf_3 



pseudonoise 



Lspjsf 



Lsp_Az 



A_Refl 



Log2 



Build_CN_code 



Synjilt 



Copy 



Copy 



Reorder Isf 



Copy 



Lsfjsp 



Get_lsp_pol 



Log2_norm 



pseudonoise 



Reorder Isf 



Copy 



Lsfjsp 



Lsp_Az 



Reorder Isf 



Copy 



Lsfjsp 



Lsp_Az 



GetJsp_pol 



GetJsp_pol 



decompress_code 



gmed_n 



decompresslO 



Log2 



gc_pred 



Pow2 



gc_pred_update 



gmed_n 



gc_pred_average_limeted 
gc_pred_update 



Log2_norm 



Log2 



Log2_norm 



Log2_norm 



gc_pred 



Pow2 



gc_pred_update 



Log2 



Log2_norm 



Log2_norm 



gmed_n 



lnv_sqrt 



gmed_n 



Copy 



Log2 



Log2_norm 



energy_old 



energy_new 



nv_sqrt 



energy_old 
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4.5 



Variables, constants and tables 



The data types of variables and tables used in the fixed point implementation are signed integers in 2's complement 
representation, defined by: 

Wordl6 16 bit variable 
Word32 32 bit variable 



Furthermore some enum types are used, all possible to represent with one byte, and a boolean Flag. 

4.5.1 Description of constants used in the C-code 

This section contains a listing of all global constants defined in cnst.h. 

Table 5: Global constants 



Constant 


Value 


Description 


L TOTAL 


320 


total size of speech buffer. 




L WINDOW 


240 


window size in LP analysis 




L FRAME 


160 


frame size 




L FRAME BY2 


80 


frame size divided by 2 




L SUBFR 


40 


subframe size 




L CODE 


40 


codevector length 




NB TRACK 


5 


number of tracks 




STEP 


5 


codebook step size 




NB TRACK MR102 


4 


number of tracks mode mr1 02 




STEP MR102 


4 


codebook step size mode mr1 02 




M 


10 


order of LP filter 




MP1 


(M+1) 


order of LP filter + 1 




LSF GAP 


205 


minimum distance between LSF after quantization 


50 Hz = 205 


LSP PRED FAC MR122 


21299 


MR122 LSP prediction factor (0.65 Q15) 




AZ SIZE 


44 


size of array of LP filters in 4 subframes (4*M+4) 




PIT MIN MR122 


18 


minimum pitch lag (MR122 mode) 




PIT MIN 


20 


minimum pitch lag (all other modes) 




PIT MAX 


143 


maximum pitch lag 




L INTERPOL 


(10+1) 


length of filter for interpolation 




L INTER SRCH 


4 


length of filter for CL LTP search interpolation 




MU 


26214 


factor for tilt compensation filter 0.8 




AGC FAC 


29491 


factor for automatic gain control 0.9 




L NEXT 


40 


overhead in LP analysis 




SHARPMAX 


13017 


maximum value of pitch sharpening 




SHARPMIN 





minimum value of pitch sharpening 




MAX PRM SIZE 


57 


max. num. of params 




MAX SERIAL SIZE 


244 


max. num. of serial bits 




GP CLIP 


15565 


pitch gain clipping = 0.95 




N FRAME 


7 


old pitch gains in average calculation 




EHF MASK 


8 


16 bit representation of all samples in the encoder 
justification) 


homing frame (left 



4.5.2 Description of fixed tables used in the C-code 

This section contains a listing of all fixed tables sorted by source file name and table name. All table data is declared as 
Wordl6. 
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Table 6: Fixed tables 



File 



Table name 



Length 



Description 



c2_9pf.c 


trackTable 


4*5 


cod_amr.c 


gammal 


10 


cod_amr.c 


gamma1_12k2 


10 


cod_amr.c 


gamma2 


10 


dtx_dec.c 


lsf_hist_mean_scale 


10 


dtx_dec.c 


dtx_log_en_adjust 


9 


ec_gains.c 


cdown 


7 


ec_gains.c 


pdown 


7 


gc_pred.c 


pred 


4 


gc_pred.c 


pred_MR122 


4 


pitchjr.c 


mode_dep_parm 


72 


post_pro.c 


a 


3 


post_pro.c 


b 


3 


pre_proc.c 


a 


3 


pre_proc.c 


b 


3 


pred_lt.c 


inter_6 


61 


pstfilt.c 


gamma3_MR122 


10 


pstfilt.c 


gamma3 


10 


pstfilt.c 


gamma4_MR122 


10 


pstfilt.c 


gamma4 


10 


bitno.tab 


prmno 


9 


bitno.tab 


prmnofsf 


8 


bitno.tab 


bitno 


9 


bitno.tab 


bitno_MR475 


17 


bitno.tab 


bitno_MR515 


19 


bitno.tab 


bitno_MR59 


19 


bitno.tab 


bitno_MR67 


19 


bitno.tab 


bitno_MR74 


19 


bitno.tab 


bitno_MR795 


23 


bitno.tab 


bitno_MR102 


39 


bitno.tab 


bitno_MR122 


57 


bitno.tab 


bitno_MRDTX 


5 


c2_11pf.tab 


startPosI 


2 


c2_11pf.tab 


startPos2 


4 


c2_9pf.tab 


startPos 


16 


corrwght.tab 


corrweight 


251 


d_homing.tab 


dhf 


8 


d_homing.tab 


dhf_MR475 


17 


d_homing.tab 


dhf_MR515 


19 


d_homing.tab 


dhf_MR59 


19 


d_homing.tab 


dhf_MR67 


19 


d_homing.tab 


dhf_MR74 


19 


d_homing.tab 


dhf_MR795 


23 


d_homing.tab 


dhf_MR102 


39 


d_homing.tab 


dhf_MR122 


57 


gains.tab 


qua_gain_pitch 


16 


gains.tab 


qua_gain_code 


96 


gray, tab 


gray 


8 


gray, tab 


dgray 


8 


grid. tab 


grid 


61 


inter_36.tab 


inter_6 


25 


inv_sqrt.tab 


table 


49 


lag_wind.tab 


lag_h 


10 


lag_wind.tab 


lagj 


10 



track table for algebraic code book search (MR475, MR51 5) 

spectral expansion factors 

spectral expansion factors 

spectral expansion factors 

initialization values for DTX Isf parameters 

level adjustments for ech mode 

attenuation factors for codebook gain 

attenuation factors for adaptive codebook gain 

algebraic code book gain MA predictor coefficients 

algebraic code book gain MA predictor coefficients (MR122) 

parameters defining the adaptive codebook search per mode 

HP filter coefficients (denominator) in Post_Process 

HP filter coefficients (numerator) in Post_Process 

HP filter coefficients (denominator) in Pre_Process 

HP filter coefficients (numerator) in Pre_Process 

interpolation filter coefficients 

spectral expansion factors 

spectral expansion factors 

spectral expansion factors 

spectral expansion factors 

number of bits for each mode 

number of parameters for LPC and first subframe for each mode (used for decoder 

homing procedure) 

pointers to the bitno_MR... tables 

number of bits per parameter to transmit (MR475) 

number of bits per parameter to transmit (MR515) 

number of bits per parameter to transmit (MR59) 

number of bits per parameter to transmit (MR67) 

number of bits per parameter to transmit (MR74) 

number of bits per parameter to transmit (MR795) 

number of bits per parameter to transmit (MR102) 

number of bits per parameter to transmit (MR122) 

number of bits per parameter to transmit (MRDTX) 

track start search position for first pulse 

track start search position for second pulse 

track start search position 

weighting of the correlation function in open loop LTP search (MR102) 

pointers to the dhf_MR... tables 

parameter values for the decoder homing frame (MR475) 

parameter values for the decoder homing frame (MR515) 

parameter values for the decoder homing frame (MR59) 

parameter values for the decoder homing frame (MR67) 

parameter values for the decoder homing frame (MR74) 

parameter values for the decoder homing frame (MR795) 

parameter values for the decoder homing frame (MR102) 

parameter values for the decoder homing frame (MR122) 

adaptive codebook gain quantization table (MR122, MR795) 

fixed codebook gain quantization table (MR122, MR795) 

gray coding table 

gray decoding table 

grid points at wich Chebyshev polynomials are evaluated 

interpolation filter coefficients 

table used in inverse square root computation 

high part of the lag window table 

low part of the lag window table 
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(continued) 



Table 6 (concluded): Fixed tables 



File 


Table name 


Length 


Description 


log2.tab 


table 


33 


table used inbase 2 logharithm computation 


Isp.tab 


lsp_init_data 


10 


initialization table for Isp history in DTX 


Ispjsf.tab 


table 


65 


table to compute cos(x) in Lsf_lsp() 


Ispjsf.tab 


slope 


64 


table to compute acos(x) in Lsp_lsf() 


ph_disp.tab 


ph_imp_low_MR795 


40 


phase dispersion impulse response (MR795) 


ph_disp.tab 


ph_imp_mid_MR795 


40 


phase dispersion impulse response (MR795) 


ph_disp.tab 


ph_imp_low 


40 


phase dispersion impulse response (MR475 - MR67) 


ph_disp.tab 


ph_imp_mid 


40 


phase dispersion impulse response (MR475 - MR67) 


pow2.tab 


table 


33 


table used in 2 to the power computation 


q_plsf_3.tab 


past_rq_init 


80 


initialization table for the MA predictor in DTX 


q_plsf_3.tab 


mean_lsf 


10 


LSF means (not in MR122) 


q_plsf_3.tab 


predjac 


10 


LSF prediction factors (not in MR122) 


q_plsf_3.tab 


dicoljsf 


3*256 


1 st LSF quantizer (not in MR122 and MR795) 


q_plsf_3.tab 


dico2_lsf 


3*512 


2 nd LSF quantizer (not in MR122) 


q_plsf_3.tab 


dico3_lsf 


4*512 


3 rd LSF quantizer (not in MR122, MR515 and MR475) 


q_plsf_3.tab 


mr515_3_lsf 


4*128 


3 rd LSF quantizer (MR515 and MR475) 


q_plsf_3.tab 


mr795_1_lsf 


3*512 


1 st LSF quantizer (MR795) 


q_plsf_5.tab 


meanjsf 


10 


LSF means (MR1 22) 


q_plsf_5.tab 


dicoljsf 


4*128 


1 st LSF quantizer (MR1 22) 


q_plsf_5.tab 


dico2_lsf 


4*256 


2 nd LSF quantizer (MR1 22) 


q_plsf_5.tab 


dico3_lsf 


4*256 


3 rd LSF quantizer (MR1 22) 


q_plsf_5.tab 


dico4_lsf 


4*256 


4 th LSF quantizer (MR1 22) 


q_plsf_5.tab 


dico5_lsf 


4*64 


5 th LSF quantizer (MR1 22) 


qgain475.tab 


table_gain_MR475 


4*256 


gain quantization table (MR475) 


qua_gain.tab 


table_gain_highrates 


128*4 


gain quantization table (MR67, MR74 and MR102) 


qua_gain.tab 


table_gain_lowrates 


64*4 


gain quantization table (MR515 and MR59) 


RJft.c 


phsjbl 


128 


sine/cosine phase table 


RJft.c 


ii_table 


8 


indexing table 


sqrt_l 


table 


49 


table to compute sqrt(x) 


Vad1 .c 


chjbl 


2*16 


channel energy combination table 


Vadl.c 


ch_tbl_sh 


16 


channel energy scaling table 


Vadl.c 


vm_tbl 


90 


voice metric table 


Vadl.c 


hangover_table 


20 


used to determine hangover as a function of SNR 


Vadl.c 


burstcounMable 


20 


used to determine burst count threshold as a function of SNR 


Vadl.c 


vm_thresh_table 


20 


used to determine the voice metric threshold as a function of SNR 


Vadl.c 


energy state tables 


2*6 


constants as a function of scaling state 


window.tab 


window_200_40 


240 


LP analysis window (not in MR122) 


window.tab 


window_1 60_80 


240 


1 st LP analysis window (MR122) 


window.tab 


window_232_8 


240 


2 nd LP analysis window (MR1 22) 



4.5.3 Static variables used in the C-code 

In this section two tables that specify the static variables for the speech encoder and decoder respectively are shown. All 
static variables are declared within a C struct. 
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Table 7: Speech encoder static variables 



Struct name 


Variable 


Type[Length] 


Description 


Speech_Encode_ 


cod_amr_state 


cod_amrState 


see below in this table 


FrameState 










pre_state 


Pre ProcessState 


see below in this table 




dtx 


Flag 


Is set if DTX functionality is used 




complexityCounter 


int 


Used for wMOPS counting 


Pre_ProcessState 


y2 hi 


Word 16 


filter state, upper word 




y2 lo 


Word 16 


filter state, lower word 




y1 hi 


Word 16 


filter state, upper word 




y1 lo 


Word 16 


filter state, lower word 




xO 


Word 16 


filter state 




x1 


Word 16 


filter state 


cod_amrState 


old_speech 


Word16[320] 


speech buffer 




speech 


Word 16* 


pointer to current frame in old_speech 




p_window 


Word 16* 


pointer to LPC analysis window in old_speech 




p_window_1 2k2 


Word 16* 


pointer to LPC analysis window with no lookahead in 
old_speech (MR122) 




new_speech 


Word 16* 


pointer to the last 160 speech samples in old_speech 




old_wsp 


Word16[303] 


buffer holding spectral weighted speech 




wsp 


Word 16* 


pointer to the current frame in old_wsp 




oldjags 


Word16[5] 


open loop LTP states 




ol_gain_flg 


Word16[2] 


enables open loop pitch lag weighting (MR102) 




old_exc 


Word16[314] 


excitation vector 




exc 


Word 16* 


current excitation 




ai_zero 


Word16[51] 


history of weighted synth. filter followed by zero vector 




zero 


Word 16* 


zero vector 




hi 


Word 16* 


impulse response of weighted synthesis filter 




hvec 


Word16[80] 


zero vector followed by impulse response 




IpcSt 


IpcState 


see below in this table 




IspSt 


IspState 


see below in this table 




cILtpSt 


cILtpState 


see below in this table 




gainQuantSt 


gainQuantState 


see below in this table 




pitchOLWghtSt 


pitchOLWghtState 


see below in this table 




tonStabSt 


tonStabState 


see below in this table 




vadSt 


vadStatel 


see below in this table 




vadSt 


vadState2 


see below in this table 




dtx 


Flag 


is set if DTX functionality is used 




dtx_encSt 


dtx encState 


see below in this table 




mem_syn 


Word16[10] 


synthesis filter memory 




mem_w0 


Word16[10] 


weighting filter memory (applied to error signal) 




mem_w 


Word16[10] 


weighting filter memory (applied to input signal) 




mem_err 


Word16[50] 


filter memory for production of error vector 




error 


Word 16* 


error signal (input minus synthesized speech) 




sharp 


Word 16 


pitch sharpening gain 


vadStatel 


bckr_est 


Word16[9] 


background noise estimate 




avejevel 


Word16[9] 


averaged input components for stationary estimation 




oldjevel 


Word16[9] 


input levels of the previous frame 




sub_level 


Word16[9] 


input levels calculated at the end of a frame (lookahead) 




a_data5 


Word16[6] 


memory for the filter bank 




a_data3 


Word16[5] 


memory for the filter bank 




burst_count 


Word 16 


counts length of a speech burst 




hang_count 


Word 16 


hangover counter 




stat_count 


Word 16 


stationary counter 




vadreg 


Word 16 


15 flags for intermediate VAD decisions 




pitch 


Word 16 


15 flags for pitch detection 




tone 


Word 16 


1 5 flags for tone detection 




complex_high 


Word 16 


flags for complex detection 




complexjow 


Word 16 


flags for complex detection 




oldlag_count 


Word 16 


variables for pitch detection 




oldlag 


Word 16 


variables for pitch detection 




complex_hang_count 


Word 16 


complex hangover counter, used by VAD 




complex_hang_timer 


Word 16 


hangover initiator, used by CAD 




best_corr_hp 


Word 16 


filtered value 




speech_vad_decision 


Word 16 


final decision 




complex_warning 


Word 16 


complex background warning 




sp_burst_count 


Word 16 


counts length of a speech burst incl HO addition 




corr_hp_fast 


Word 16 


filtered value 






(continued) 
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Table 7 (concluded): Speech encoder static variables 



Struct name 


Variable 


Type[Length] 


Description 


vadState2 


pre_emp_mem 

update_cnt 

hyster_cnt 

last_update_cnt 

ch_enrg_long_db 

Lframe_cnt 

Lch_enrg 

Lch_noise 

last_normb_shift 

tsnr 

hangover 

burstcount 

fupdatejiag 

negSNRvar 

negSNRbias 

shift state 

L_R0 

L Rmax 

LTP flag 


Word 16 
Word 16 
Word 16 
Word 16 
Word16[16] 
Word32 
Word32[16] 
Word32[16] 
Word 16 
Word 16 
Word 16 
Word 16 
Word 16 
Word16 
Word 16 
Word16 
Word32 
Word32 
Flag 


input pre-emphasis memory 

noise update counter 

hysteresis counter 

noise update counter value for last frame 

long term channel energy in dB 

10 ms frame counter 

channel energy estimate 

channel noise estimate 

block shift factor for last frame, used for pre_emp_mem 

total estimated peak SNR in dB 

VAD hangover 

number of consecutive voice active frames 

A flag to control a forced update of the noise estimate 

SNR variability 

sensitivity bias 

indicates scaling state of channel energy estimate 

LTP energy 

LTP max correlation 

set when open loop pitch prediction gain > threshold 


dtx_encState 


lsp_hist 
log_en_hist 
hist_ptr 
log_en_index 

i nit_lsf_vq_i ndex 

Ispjndex 

dtxHangoverCount 

decAnaElapsedCount 


Word16[80] 
Word16[8] 
Word 16 
Word 16 

Word 16 
Word16[3] 
Word16 
Word 16 


LSP history (8 frames) 
logarithmic frame energy history (8 frames) 
pointer to the cyclic history vectors 
Index for logarithmic energy 

initial index for Isf predictor 
Isp indecies to the three code books 
is decreased in DTX hangover period 
counter for elapsed speech frames in DTX 


IpcState 


LevinsonSt 


LevinsonState 


see below 


LevinsonState 


old A 


Word16[11] 


last frames direct form coefficients 


IspState 


lsp_old 
Isp old q 
qSt 


Word16[10] 
Word16[10] 
Q plsfState 


old LSP vector 

old quantized LSP vector 

see below in this table 


Q_plsfState 


past_rq 


Word16[10] 


past quantized LSF prediction error 


cILtpState 


pitchSt 


Pitch frState 


see below in this table 


tonStabState 


count 

gp 


Word 16 
Word16[7l 


count consecutive (potential) resonance frames 
pitch gain history 


Pitch frState 


TO prev subframe 


Word 16 


integer, pitch lag of previous subframe 


gainQuantState 


sfO_exp_gcodeO 

sfO_frac_gcodeO 

sfO_exp_target_en 

sfO_frac_target_en 

sfO_exp_coeff 

sfO_frac_coeff 

gain_idx_ptr 

gc_predSt 

gc_predUncSt 

adaptSt 


Word 16 
Word 16 
Word 16 
Word 16 
Word16[5] 
Word16[5] 
Word 16* 
gc_predState 
gc_predState 
GainAdaptState 


subframe 0/2 codebook gain exponent 
subframe 0/2 codebook gain fraction 
subframe 0/2 target energy exponent 
subframe 0/2 target energy fraction 
subframe 0/2 energy coefficient exponents 
subframe 0/2 energy coefficient fractions 
pointer to gain index value in parameter frame 
see below in this table 
see below in this table 
see below in this table 


gc_predState 


past_qua_en 

past qua en MR122 


Word16[4] 
Word16[4] 


MA predictor memory (20*log10(pred. error)) 

MA predictor memory, 12.2 style (log2(pred. error)) 


GainAdaptState 


onset 

prev_alpha 
prev_gc 
Itpg mem 


Word 16 
Word 16 
Word 16 
Word16[5] 


onset counter 
previous adaptor output 
previous codebook gain 
pitch gain history 


pitchOLWghtState 


old_T0_med 

ada_w 

wghtjlg 


Word 16 
Word 16 
Word 16 


weighted open loop pitch lag 

weigthing level depeding on open loop pitch gain 

switches lag weighting on and off 
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Table 8: Speech decoder static variables 



Struct name 


Variable 


Type[Length] 


Description 


Speech_Decode_FrameSt 
ate 


decoder_amrState 


Decoder_amrState 


see below in this table 


post_state 


Post_FilterState 


see below in this table 




postHP_state 


Post_ProcessState 


see below in this table 




ComplexityCounter 


int 


Used for wMOPS counting 


Decoder_amrState 


old_exc 


Word16[194] 


excitation vector 




exc 


Word 16* 


current excitation 




lsp_old 


Word16[10] 


LSP vector of previous frame 




mem_syn 


Word16[10] 


synthesis filter memory 




sharp 


Word 16 


pitch sharpening gain 




old_T0 


Word 16 


pitch sharpening lag 




prev_bf 


Word 16 


previous value of "bad frame" flag 




prev_pdf 


Word 16 


previous value of "pot. dangerous frame" flag 




state 


Word 16 


ECU state (0..6) 




excEnergyHist 


Word16[9] 


excitation energy history 




TOJagBuff 


Word 16 


received pitch lag for ECU 




inBackgroundNoise 


Word16 


background noise flag 




voicedHangover 


Word 16 


hangover flag 




ItpGainHistory 


Word16[9] 


pitch gain history 




background_state 


Bgn_scdState 


see below in this table 




Cb_gain_averState 


Cb_gain_averageState 


see below in this table 




Isp avg st 


Isp avgState 


see below in this table 




IsfState 


D_plsfState 


see below in this table 




ec_gain_p_st 


ec_gain_pitchState 


see below in this table 




ec_gain_c_st 


ec_gain_codeState 


see below in this table 




pred_state 


gc predState 


see table 7 




nodataSeed 


Word 16 


seed for CN generator 




ph_disp_st 


ph_dispState 


see below in this table 




dtxDecoderState 


dtx decState 


see below in this table 


dtx_decState 


since_last_sid 


Word 16 


number of frames since last SID frame 




true_sid_period_inv 


Word 16 


inverse of true SID update rate 




log_en 


Word 16 


logarithmic frame energy 




old_log_en 


Word 16 


previous value of log_en 




L_pn_seed_rx 


Word32 


random number generator seed 




Isp 


Word16[10] 


LSP vector 




lsp_old 


Word16[10] 


previous LSP vector 




lsf_hist 


Word16[80] 


LSF vector history (8 frames) 




lsf_hist_ptr 


Word 16 


index to beginning of LSF history 




lsf_hist_mean 


Word16[80] 


mean-removed LSF history (8 frames) 




log_pg_mean 


Word 16 


mean-removed logarithmic prediction gain 




log_en_hist 


Word16[8] 


logarithmic frame energy history 




log_en_hist_ptr 


Word 16 


index to beginning of log, frame energy history 




log_en_adjust 


Word 16 


mode-dependent frame energy adjustment 




dtxHangoverCount 


Word 16 


counts down in hangover period 




decAnaElapsedCount 


Word 16 


counts elapsed speech frames after DTX 




sidjrame 


Word 16 


flags SID frames 




valid_data 


Word 16 


flags SID frames containing valid data 




dtxHangoverAdded 


Word 16 


flags hangover period at end of speech 




dtxGlobalState 


enum DTXStateType 


DTX state flags 




data_updated 


Word 16 


flags CNI updates 


Bgn_scdState 


frameEnergyHist 


Word16[60] 


history of synthesis frame energy 




bgHangover 


Word 16 


number of frames since last speech frame 


Cb_gain_averageState 


cbGainHistory 


Word16[7] 


codebook gain history 




hangVar 


Word 16 


counts length of talkspurt in subframes 




hangCount 


Word 16 


number of subframes since last talkspurt 


lsp_avgState 


lsp_meanSave 


Word16[10] 


averaged LSP vector 


D_plsfState 


past_r_q 


Word16[10] 


past quantized LSF prediction vector 




past_lsf_q 


Word16[10] 


past dequantized LSF vector 


ec_gain_pitchState 


pbuf 


Word16[5] 


pitch gain history 




past_gain_pit 


Word 16 


previous pitch gain (limited to 1 .0) 




prev_gp 


Word 16 


previous good pitch gain 


ec_gain_codeState 


gbuf 


Word16[5] 


codebook gain history 




past_gain_code 


Word 16 


previous codebook gain 




prev_gc 


Word 16 


previous good codebook gain 






(continued) 
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Table 8 (concluded): Speech decoder static variables 



Struct name 


Variable 


Type[Length] 


Description 


ph_dispState 


gainMem 

prevState 

prevCbGain 

lockFull 

onset 


Word16[5] 
Word 16 
Word 16 
Word 16 
Word 16 


pitch gain history 

previously used impulse response 

previous codebook gain 

force maximum phase dispersion 

onset counter 


Post_FilterState 


res2 

mem_syn_pst 
synth_buf 
agc_state 
preemph state 


Word16[40] 

Word16[10] 

Word16[170] 

agcState 

preemphasisState 


LP residual 

synthesis filter memory 
synthesis filter work area 
see below in this table 
see below in this table 


agcState 


past_gain 


Word 16 


past age gain 


preemphasisState 


mem pre 


Word 16 


filter state 


Post_ProcessState 


y2 hi 
y2 lo 
y1 hi 
y1 lo 
xO 
x1 


Word 16 
Word 16 
Word 16 
Word 16 
Word 16 
Word 16 


filter state, upper word 
filter state, lower word 
filter state, upper word 
filter state, lower word 
filter state 
filter state 



Homing procedure 



The principles of the homing procedures are described in [2]. This specification only includes a detailed description of 
the 8 decoder homing frames. For each AMR codec mode, the corresponding decoder homing frame has a fixed set of 
speech parameters shown in table 9a-9h. The bit allocation within these parameters is identical to the corresponding bit 
allocation of the source encoder output parameters given in [2]. 

In the following tables, the following naming convention is used for the individual parameters. Letters in italics indicate 
numbers. 



LPC_n 
LTP-LAG m 
LTP-GAIN m 
FCB-GAIN m 
GAIN_VQ m 
POS m_n 
POS m_n_k 
POS m_n_k_l_j 
SIGN m_n_k 
SIGN m_ n_k_lj 
SIGN m n k POS m n 



index of nth LSF submatrix 

adaptive codebook index for subframe m 

adaptive codebook gain index in subframe m 

fixed codebook gain index in subframe m 

codebook gain VQ index in subframe m (subframe m and m+1 for MR475) 

position index of nth pulse in subframe m 

position index of nth and A:th pulse in subframe m 

position index of nth, kth, fth, and jth pulse in subframe m 

sign information for nth and A:th pulse in subframe m 

sign information for nth, A:th, fth, and j'th pulse in subframe m 

sign information for nth and kth pulse and position index for nth pulse in subframe m 
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Table 9a: Parameter values for the decoder homing frame (MR475) 



Parameter 


Value (LSB=b0) 


LPC1 


0x00F8 


LPC2 


0x009D 


LPC3 


0x001 C 


LTP-LAG 1 


0x0066 


POS 1 1 2 


0x0000 


SIGN 1 1 2 


0x0003 


GAIN-VQ 1 


0x0028 


LTP-LAG 2 


OxOOOF 


POS 2 1 2 


0x0038 


SIGN 2 1 2 


0x0001 


LTP-LAG 3 


OxOOOF 


POS 3 1 2 


0x0031 


SIGN 3 1 2 


0x0002 


GAIN-VQ 3 


0x0008 


LTP-LAG 4 


OxOOOF 


POS 4 1 2 


0x0026 


SIGN 4 1 2 


0x0003 



Table 9b: Parameter values for the decoder homing frame (MR515) 



Parameter 


Value (LSB=b0) 


LPC1 


0x00F8 


LPC2 


0x009D 


LPC3 


0x001 C 


LTP-LAG 1 


0x0066 


POS 1 1 2 


0x0000 


SIGN 1 1 2 


0x0003 


GAIN-VQ 1 


0x0037 


LTP-LAG 2 


OxOOOF 


POS 2 1 2 


0x0000 


SIGN 2 1 2 


0x0003 


GAIN-VQ 2 


0x0005 


LTP-LAG 3 


OxOOOF 


POS 3 1 2 


0x0037 


SIGN 3 1 2 


0x0003 


GAIN-VQ 3 


0x0037 


LTP-LAG 4 


OxOOOF 


POS 4 1 2 


0x0023 


SIGN 4 1 2 


0x0003 


GAIN-VQ 4 


0x001 F 
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Table 9c: Parameter values for the decoder homing frame (MR59) 



Parameter 


Value (LSB=b0) 


LPC1 


0x00F8 


LPC2 


0x00E3 


LPC3 


0x002F 


LTP-LAG 1 


OxOOBD 


POS 1 1 2 


0x0000 


SIGN 1 1 2 


0x0003 


GAIN-VQ 1 


0x0037 


LTP-LAG 2 


OxOOOF 


POS 2 1 2 


0x0001 


SIGN 2 1 2 


0x0003 


GAIN-VQ 2 


OxOOOF 


LTP-LAG 3 


0x0060 


POS 3 1 2 


0x00F9 


SIGN 3 1 2 


0x0003 


GAIN-VQ 3 


0x0037 


LTP-LAG 4 


OxOOOF 


POS 4 1 2 


0x0000 


SIGN 4 1 2 


0x0003 


GAIN-VQ 4 


0x0037 



Table 9d: Parameter values for the decoder homing frame (MR67) 



Parameter 


Value (LSB=b0) 


LPC1 


0x00F8 


LPC2 


0x00E3 


LPC3 


0x002F 


LTP-LAG 1 


OxOOBD 


POS 112 3 


0x0002 


SIGN 112 3 


0x0007 


GAIN-VQ 1 


0x0000 


LTP-LAG 2 


OxOOOF 


POS 2 1 2 3 


0x0098 


SIGN 2 12 3 


0x0007 


GAIN-VQ 2 


0x0061 


LTP-LAG 3 


0x0060 


POS 3 1 2 3 


0x05C5 


SIGN 3 12 3 


0x0007 


GAIN-VQ 3 


0x0000 


LTP-LAG 4 


OxOOOF 


POS 4 1 2 3 


0x0318 


SIGN 4 12 3 


0x0007 


GAIN-VQ 4 


0x0000 



ETSI 



(3G TS 26.073 version 3.0.0 Release 1999) 



20 



ETSI TS 126 073 V3.0.0 (2000-01) 



Table 9e: Parameter values for the decoder homing frame (MR74) 



Parameter 


Value (LSB=b0) 


LPC1 


0x00F8 


LPC2 


0x00E3 


LPC3 


0x002F 


LTP-LAG 1 


OxOOBD 


POS 112 3 4 


0x0006 


SIGN 112 3 4 


OxOOOF 


GAIN-VQ 1 


0x0000 


LTP-LAG 2 


0x001 B 


POS 2 12 3 4 


0x0208 


SIGN 2 12 3 4 


OxOOOF 


GAIN-VQ 2 


0x0062 


LTP-LAG 3 


0x0060 


POS 3 12 3 4 


0x1 BA6 


SIGN 3 12 3 4 


OxOOOF 


GAIN-VQ 3 


0x0000 


LTP-LAG 4 


0x001 B 


POS 4 12 3 4 


0x0006 


SIGN 4 12 3 4 


OxOOOF 


GAIN-VQ 4 


0x0000 



Table 9f: Parameter values for the decoder homing frame (MR795) 



Parameter 


Value (LSB=b0) 


LPC1 


0x00C2 


LPC2 


0x00E3 


LPC3 


0x002F 


LTP-LAG 1 


OxOOBD 


POS 112 3 4 


0x0006 


SIGN 112 3 4 


OxOOOF 


LTP-GAIN 1 


OxOOOA 


FCB-GAIN 1 


0x0000 


LTP-LAG 2 


0x0039 


POS 2 12 3 4 


0x1 C08 


SIGN 2 12 3 4 


0x0007 


LTP-GAIN 2 


OxOOOA 


FCB-GAIN 2 


OxOOOB 


LTP-LAG 3 


0x0063 


POS 3 12 3 4 


0x1 1A6 


SIGN 3 12 3 4 


OxOOOF 


LTP-GAIN 3 


0x0001 


FCB-GAIN 3 


0x0000 


LTP-LAG 4 


0x0039 


POS 4 12 3 4 


0x09A0 


SIGN 4 12 3 4 


OxOOOF 


LTP-GAIN 4 


0x0002 


FCB-GAIN 4 


0x0001 
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Table 9g: Parameter values for the decoder homing frame (MR102) 



Parameter 


Value (LSB=b0) 


LPC1 


0x00F8 


LPC2 


0x00E3 


LPC3 


0x002F 


LTP-LAG 1 


0x0045 


SIGN 1 1 5 


0x0000 


SIGN 1 2 6 


0x0000 


SIGN 1 3 7 


0x0000 


SIGN 1 4 8 


0x0000 


POS 112 5 


0x0000 


POS 13 6 7 


0x0000 


POS 1 4 8 


0x0000 


GAIN-VQ 1 


0x0000 


LTP-LAG 2 


0x001 B 


SIGN 2 1 5 


0x0000 


SIGN 2 2 6 


0x0001 


SIGN 2 3 7 


0x0000 


SIGN 2 4 8 


0x0001 


POS 2 12 5 


0x0326 


POS 2 3 6 7 


OxOOCE 


POS 2 4 8 


0x007E 


GAIN-VQ 2 


0x0051 


LTP-LAG 3 


0x0062 


SIGN 3 1 5 


0x0000 


SIGN 3 2 6 


0x0000 


SIGN 3 3 7 


0x0000 


SIGN 3 4 8 


0x0000 


POS 3 12 5 


0x01 5 A 


POS 3 3 6 7 


0x0359 


POS 3 4 8 


0x0076 


GAIN-VQ 3 


0x0000 


LTP-LAG 4 


0x001 B 


SIGN 4 1 5 


0x0000 


SIGN 4 2 6 


0x0000 


SIGN 4 3 7 


0x0000 


SIGN 4 4 8 


0x0000 


POS 4 12 5 


0x01 7C 


POS 4 3 6 7 


0x0215 


POS 4 4 8 


0x0038 


GAIN-VQ 4 


0x0030 
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Table 9h: Parameter values for the decoder homing frame (MR122) 



Parameter 


Value (LSB=b0) 


LPC1 


0x0004 


LPC2 


0x002A 


LPC3 


OxOODB 


LPC4 


0x0096 


LPC5 


0x002A 


LTP-LAG 1 


0x0156 


LTP-GAIN 1 


OxOOOB 


SIGN 1 1 6 POS 1 1 


0x0000 


SIGN 1 2 7 POS 1 2 


0x0000 


SIGN 1 3 8 POS 1 3 


0x0000 


SIGN 1 4 9 POS 1 4 


0x0000 


SIGN 1 5 10 POS 1 5 


0x0000 


POS1 6 


0x0000 


POS1 7 


0x0000 


POS1 8 


0x0000 


POS1 9 


0x0000 


POS1 10 


0x0000 


FCB-GAIN 1 


0x0000 


LTP-LAG 2 


0x0036 


LTP-GAIN 2 


OxOOOB 


SIGN 2 1 6 POS 2 1 


0x0000 


SIGN 2 2 7 POS 2 2 


OxOOOF 


SIGN 2 3 8 POS 2 3 


OxOOOE 


SIGN 2 4 9 POS 2 4 


OxOOOC 


SIGN 2 5 10 POS 2 5 


OxOOOD 


POS 2 6 


0x0000 


POS 2 7 


0x0001 


POS 2 8 


0x0005 


POS 2 9 


0x0007 


POS 2 10 


0x0001 


FCB-GAIN 2 


0x0008 


LTP-LAG 3 


0x0024 


LTP-GAIN 3 


0x0000 


SIGN 3 1 6 POS 3 1 


0x0001 


SIGN 3 2 7 POS 3 2 


0x0000 


SIGN 3 3 8 POS 3 3 


0x0005 


SIGN 3 4 9 POS 3 4 


0x0006 


SIGN 3 5 10 POS 3 5 


0x0001 


POS 3 6 


0x0002 


POS 3 7 


0x0004 


POS 3 8 


0x0007 


POS 3 9 


0x0004 


POS 3 10 


0x0002 


FCB-GAIN 3 


0x0003 


LTP-LAG 4 


0x0036 


LTP-GAIN 4 


OxOOOB 


SIGN 4 1 6 POS 4 1 


0x0000 


SIGN 4 2 7 POS 4 2 


0x0002 


SIGN 4 3 8 POS 4 3 


0x0004 


SIGN 4 4 9 POS 4 4 


0x0000 


SIGN 4 5 10 POS 4 5 


0x0003 


POS 4 6 


0x0006 


POS 4 7 


0x0001 


POS 4 8 


0x0007 


POS 4 9 


0x0006 


POS 4 10 


0x0005 


FCB-GAIN 4 


0x0000 
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6 File formats 

This section describes the file formats used by the encoder and decoder programs. The test sequences defined in [1] also 
use the file formats described here. 

6.1 Speech file (encoder input / decoder output) 

Spech files read by the encoder and written by the decoder consist of 16-bit words where each word contains a 13-bit, 
left aligned speech sample. The byte order depends on the host architecture (e.g. MSByte first on SUN workstations, 
LSByte first on PCs etc.). Both the encoder and the decoder program process complete frames (of 160 samples) only. 

This means that the encoder will only process n frames if the length of the input file is n*160 + k words, while the 
files produced by the decoder will always have a length of n*160 words, 

6.2 Mode control file (encoder input) 

The encoder program can optionally read in a mode control file which specifies the encoding mode for each frame of 
speech processed. The file is a text file containing one line per speech frame. Each line contains one of the mode names 
from the list {MR475, MR515, MR59, MR67, MR74, MR795, MR102, MR122}. 

6.3 Parameter bitstream file (encoder output / decoder input) 

The files produced by the speech encoder/expected by the speech decoder contain an arbitrary number of frames in the 
following format: 



FRAME_TYPE 


Bl 


B2 




B244 


MODE_INFO 


unusedl 




unused4 



Each box corresponds to one Wordl 6 value in the bitstream file, for a total of 250 words or 500 bytes per frame. The 
fields have the following meaning: 



FRAME_TYPE 


transmit frame 


type, 




TX_SPEECH 


(0x0000) 




TX_SID_FIRST 


(0x0001) 




TX_SID_UPDATE 


(0x0002) 




TX_NO_DATA 


(0x0003) 



which 



is 



one 



of 



S0...B244 



speech encoder parameter bits (i.e. the bitstream itself). Each Bx either has the 
value 0x0000 or 0x0001. Only mode MR122 really uses all 244 bits; for the other 

modes, only the first n bits are used (35 < n < 204). The remaining bits are unused 
(written as 0x0 0) 



MODE_INFO 


encoding 


mode 


information 




MR4 7 5 




(0x0000) 




MR515 




(0x0001) 




MR5 9 




(0x0002) 




MR67 




(0x0003) 




MR7 4 




(0x0004) 




MR7 95 




(0x0005) 




MR102 




(0x0006) 




MR12 2 




(0x0007) 


unusedl... 4 


unused, written as 0x0000 





which 



is 



one 



of 
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As indicated in section 6.1 above, the byte order depends on the host architecture. 



Annex A (informative): 
Change Request History 
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